CloudKon: a CLOUD-enabled distributed tasK executiON framework
نویسندگان
چکیده
— Task scheduling and execution over large scale, distributed systems plays an important role on achieving good performance and high system utilization. Job management systems need to support applications (e.g. Many-Task Computing – MTC, MapReduce) with a growing number of tasks with finer granularity due to the explosion of parallelism found in today's hardware which requires techniques such as over-decomposition to deliver good performance. Our goal in this work is to provide a compact, lightweight , scalable, and distributed task execution framework (CloudKon) that builds upon cloud computing building blocks (Amazon EC2, SQS, and DynamoDB). Most of Today's state-of-the-art job execution systems have predominantly Master/Slaves architectures, which have inherent limitations, such as scalability issues at extreme scales and single point of failures. On the other hand distributed job management systems are complex, and employ non-trivial load balancing algorithms to maintain good utilization. CloudKon is a distributed job management system that can support millions of tasks from multiple users delivering over 2X the performance compared to other state-of-the-art systems in terms of throughput – all with a code-base of less than 5%. Although this work was motivated by the support of MTC applications, we will outline the possible support of HPC applications as well.
منابع مشابه
An Effective Task Scheduling Framework for Cloud Computing using NSGA-II
Cloud computing is a model for convenient on-demand user’s access to changeable and configurable computing resources such as networks, servers, storage, applications, and services with minimal management of resources and service provider interaction. Task scheduling is regarded as a fundamental issue in cloud computing which aims at distributing the load on the different resources of a distribu...
متن کاملCloudKon Reloaded with Efficient Monitoring, Bundled Responses, and Dynamic Provisioning
In today's world the emphasis is on distributed systems which plays an important role on achieving good performance , high system utilization and scalability. Task scheduling and execution over large scale, distributed systems plays an important role on achieving good performance and high system utilization. Due to the explosion of parallelism found in today’s hardware, applications need to per...
متن کاملDistributed Scheduling and monitoring service leveraging FaBRiQ as a building block for CloudKon+
In today’s world, the scientific community is moving towards distributed systems which plays an important role on achieving good performance and scalability. Task scheduling and execution over large scale, distributed systems plays an important role on achieving good performance and high system utilization[15]. Most of todays state-of-the-art job execution systems are centralized architectures,...
متن کاملScalable Resource Management in Cloud Computing
The exponential growth of data and application complexity has brought new challenges in the distributed computing field. Scientific applications are growing more diverse with various workloads, including traditional MPI high performance computing (HPC) to fine-grained loosely coupled many-task computing (MTC). Traditionally, these workloads have been shown to run well on supercomputers and high...
متن کاملGreen Energy-aware task scheduling using the DVFS technique in Cloud Computing
Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013